Finding Transcription Factor Binding Sites in Coregulated Genes by Exhaustive Sequence Search
نویسندگان
چکیده
Growing amounts of gene expression data provide the possibility of finding coregulated genes by clustering methods. By analysis of the promoter regions of these genes, rather weak signals of transcription factor binding sites may be detected [Zhang, 1999]. We compare existing programs and own software on yeast clusters. Therefore, we introduce the new algorithm ITB, an integrated tool for box finding, which exhaustively analyses regular expression-like patterns in promoter sequences, allowing gaps and the matching of more than one base at any position within the candidates. The applicability of ITB to predict transcription factor binding sites in human promoter sequences is evaluated.
منابع مشابه
Combining frequency and positional information to predict transcription factor binding sites
MOTIVATION Even though a number of genome projects have been finished on the sequence level, still only a small proportion of DNA regulatory elements have been identified. Growing amounts of gene expression data provide the possibility of finding coregulated genes by clustering methods. By analysis of the promoter regions of those genes, rather weak signals of transcription factor binding sites...
متن کاملFinding Transcription Factor Binding Motifs for Coregulated Genes by Combining Sequence Overrepresentation with Cross-Species Conservation
Novel computational methods for finding transcription factor binding motifs have long been sought due to tedious work of experimentally identifying them. However, the current prevailing methods yield a large number of false positive predictions due to the short, variable nature of transcriptional factor binding sites TFBSs . We proposed here a method that combines sequence overrepresentation an...
متن کاملA Statistical Method for Finding Transcription Factor Binding Sites
Understanding the mechanisms that determine the regulation of gene expression is an important and challenging problem. A fundamental subproblem is to identify DNA-binding sites for unknown regulatory factors, given a collection of genes believed to be coregulated, and given the noncoding DNA sequences near those genes. We present an enumerative statistical method for identifying good candidates...
متن کاملFinding motifs from all sequences with and without binding sites
MOTIVATION Finding common patterns, motifs, from a set of promoter regions of coregulated genes is an important problem in molecular biology. Most existing motif-finding algorithms consider a set of sequences bound by the transcription factor as the only input. However, we can get better results by considering sequences that are not bound by the transcription factor as an additional input. RE...
متن کاملDifferential Expression of Alpha S1 Casein and Beta-Lactoglobulin Genes at Different Physiological stages of the Adani Goats Mammary Glands
Background: Milk proteins genes have been the focus of the researches as the candidate target genes that play a decisive role when animal breeding is desired.Objectives: In the present study, the transcriptional levels of Beta-lactoglobulin (BLG) and Alpha S1 casein (CSN1S1) genes were investigated during prenatal, milking and drying times in mammary glands of the Adani goats which showed...
متن کامل